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ABSTRACT 



Aims. We present CAIXA, a Catalogue of AGN In the XMM-Newton Archive. It consists of all the radio-quiet X-ray unobscured 
(Nh < 2 X 10'^ cm"-) Active Galactic Nuclei (AGN) observed by XMM-A'ewton in targeted observations, whose data are public as of 
March 2007. With its 156 sources, this is the largest catalogue of high signal-to-noise X-ray spectra of AGN. 

Methods. All the EPIC pn spectra of the sources in CAIXA were extracted homogeneously and a baseline model was applied in 
order to derive their basic X-ray properties. These data are complemented by multiwavelength data found in the literature: Black Hole 
masses. Full Width Half Maximum (FWHM) of H/?, radio and optical fluxes. 

Results. Here we describe our homogeneous spectral analysis of the X-ray data in CAIXA and present all the results on the parameters 
adopted in our best-fit models. 

Key words. Galaxies: active - Galaxies: Seyfert - quasars: general - X-rays: general 



1. Introduction 

Despite the unquestionable progress made in our understanding 
of the physics of Active Galactic Nuclei (AGN), there are still 
many open issues related to the X-ray properties of these ob- 
, jects. Some of the most discussed ones in the last years are: the 
■ highly debated nature of the soft X-ray excess in unobscured 
' sources; the different spectral and timing properties for different 
classes of sources; the nature of radio emission in radio-quiet 
objects and its relation to the X-ray nuclear emission; the origin 
I of the reprocessing of the primary emission from highly ionized 
' material; the correlation of the above-mentioned phenomenol- 
ogy with fundamental properties of AGN, such as the Black 
Hole (BH) mass and the accretion rate. The most effective way 
' to address these questions is to analyze large numbers of AGN 
' with good-quality X-ray spectra and to perform statistical stud- 
ies, taking into account data in other wavelengths and other basic 
properties of the objects. 

The European Space Agency's (ESA) X-ray Multi-Mirror 
Mission (XMM-A^ewfon) was launched on December 10th 1999. 
The XMM-Newton public archive has since become the repos- 
itory of an enormous amount of high-quality X-ray data and a 
precious legacy for future missions. In particular, our knowl- 
edge of the physics of AGN has dramatically improved thanks 
to the large sensitivity of the European Photon Imaging Camera 
(EPIC) pn charge-coupled device (CCD) aiTays. At this time, a 
systematic and homogeneous study of the EPIC pn spectra of 
AGN represents a necessary step to fully take advantage of this 
highly successful X-ray mission. 

In this paper, we present CAIXA, a Catalogue of AGN In 
the X MM-Newton Archive. This catalogue was already used by 
iBianchi et al.l (l2007i) to confirm with much higher confidence the 



'Iwasawa-Taniguchi' effect (i.e. the anti-correlation between the 
Equivalent Width of the neut ral iron narrow emissio n line and 
the X-ray luminosity) and by iGuainazzi et a l.l (I2006a| , a smaller 
catalogue was used at that period) and lLon ginotti e t al.l ( l2007h . 
to assess the frequency of the relativistic component of the iron 
line. Here we describe our homogeneous spectral analysis of the 
X-ray data in CAIXA and present all the results on the parame- 
ters adopted in our best-fit models. In following papers, we will 
present the timing analysis and properties of CAIXA and we 
will investigate the correlations between the X-ray and the mul- 
tiwavelength properties of the sources in the catalogue. 



2. The catalogue 

2. 1. X-ray data reduction and spectral analysis 

CAIXA consists of all the radio-quiet X-ray unobscured (Nh < 
2 X 10^^ cm"^) AGN observed by XMM-A^ewfo« in targeted 
observations, whose dat a are 
EPIC pn (Struderet a l1 l200lj 
SASv6.5 (lGabrieIetaI.iV 



ubiic as of March 2007. Only 
. ._ , _.. data were reprocessed, with 
20041). For the observations performed 
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in Small Window mode, background spectra were generated us- 
ing blan k-field event lists, according to the procedure presented 
in Read & PonmanI (l2003h . In all other cases, background spec- 
tra were extracted from source-free regions close to the target in 
the observation event file. 

Source extraction radii and screening for intervals of flar- 
ing particle background were performed via an iterative process 
which leads to a maximiz ation of the signal-to-no ise ratio, sim- 
ilarly to what described in lPiconcelli et al.l (|2004|) . Spectra were 
binned in order to oversample the intrinsic instrumental energy 
resolution by a factor not lower than 3, and to have spectral bins 
with at least 25 background-subtracted counts. This ensures the 
applicability of the;^'^ statistics. 
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We applied a number of criteria to filter the catalogue. First 
of all, spectra with less than 200 counts in either of the (rest- 
frame) bands of 0.5-2 and 2-10 keV were rejected, because 
they do not possess enough independent bins to be fitted with 
our models. Moreover, spectra affected by a pileup larger than 
1% were rejected. Only two objects, namely NGC 2992 and 
ESO 548-G081, did not have an alternative pileup-free observa- 
tion and were therefore excluded from the catalogue. After the 
fitting procedure, all sources with a local column density (mea- 
sured in the 2-10 keV band) larger than 2 x 10^^ cm"- were fur- 
ther excluded from the catalogue. 

In order to ensure that only radio-quiet object were included 
in CAIXA, we collected radio data at 6 cm (4.85 GHz) and 
20 cm (1.4 GHz) from the literature and calculated the K- 
corrected radio-loudness parameter R (log^ - log f(,cm - log /b- 
IStocke et al. 1992) and the X-ray r adio-loudness parameter Rx 
(logRx - log fhrn. - log fq-invpv: iTerashima & Wilson l l2003l) . 
When only a 20 cm radio flux was available, it was converted 
to 6 cm, assuming a power law radio spectrum (fy oc v^"'), 
where = 0.7 is the average two-point radio spectral index 
for the sources in CAIXA with measures both at 6 and 20 cm 
(61 sources). The same procedure was followed for the (few) 
843 MHz measures. The optical flux /b was derived from the 
V magnitude and the colour index B-V, the latter assumed to 
be 0.3 if not available in literature. According to the standard 
definition, all quasars with \og{R) > 1 were excluded, while 
for Seyfert galaxies, known to be on average 'radio-louder', 
log{R) > 2.4 and losjRx) > -2.755 were used as bound- 
aries (see lPanessa et al.l 20071) . Note that three quasars are nomi- 
nally radio-loud, having values of R of 1.09 (RXJ0136.9-3510), 
1.01 (RXJ0228. 2-4057) and 1.08 (KUV18217H-6419). However, 
given that the uncertainties on the radio and optical fluxes are not 
taken into consideration, we are confident these sources can be 
safely considered radio-quiet. Moreover, we did not find radio 
measures for the following sources: ESO 198-G24, RXJ0323.2- 
4931, Fairafl 1116, 1H0707-495, J124126.3-574955, H1846- 
786, RXJ2241. 8-4405, REJ2248-511. Some of them are treated 
as radio-quiet sources in papers cited in Table[3]and, in any case, 
the vast majority of them are Seyfert galaxies and unlikely to be 
radio-loud with our criteria. Therefore, we kept all of them in 
our final catalogue. F inally, with respect to the catalogue used in 
iBianchi et all (|2007|) . we excluded PMNJ06 23-6436, for whic h 
we found a new radio observation at 6 cm (ISadler et al.ll2006h . 
leading to log R ^ 2. N evertheless, this exclu sion does not affect 
the results published in lBianchi et al.l (l2007h . 

At the end of this selection procedure, the total catalogue 
comprises 156 radio-quiet AGN. When more than one obser- 
vation of the same object was available, we used the one with 
the longest exposure, independently of its flux status, in order 
to avoid, as much as possible, any bias. Tables [T] and |2] summa- 
rize the properties of the XMM-Newton observations included 
in CAIXA, along with the fluxes, luminosities and main X-ray 
parameters derived from the fits with the baseline model. 

The baseline model consists of the following components: 

F (E) = e-'^^^'^" [Ae-'^^^^^'^'E-^' + Be-<^>'^"£-r„ ^ 4 ^ 

where cr (E) is the ph otoelectric cross-s e ction, adopting so- 
lar abundances as in lAnders & Ebiharal (|1982|) . A^^ is the 
Galactic column density appropriate for each source (after 
iDickev & Lockmanlll99 0^. N'^ < N'^ are two local column den- 
sities at the redshift of the source, possibly coexisting if related 
to absorption on different scales, A and B are two normaliza- 
tion factors, and F/, the spectral photon indexes for the soft 



an the hard spectrum, and G are four Gaussian emission lines at 
rest-frame energies fixed to 6.4, 6.7, 6.96 and 7.06 keV, as appro- 
priate for neutral. He- and H-like iron Ka and neutral iron Kj6, 
respectively. The inclusion of the latter allows us to disentangle 
its possible contamination to the 6.96 keV line, by not allowing 
its flux to be larger t han 0. 16 times the flux of the neutral Ka line 
dMolendi et alj20()3i) . A relativistically broadened comp onent of 
the ir on line was never included in our fits (see Bianc hTet al.l 
120071 for a discussion on this point). Where the addition of the 
second powerlaw leads to ax^ improvement less than 99 % (ac- 
cording to F-test), we put B - and the baseline model only 
includes a powerlaw component. Finally, two absorption edges 
at 0.737 (O VII) and 0.871 (O Vlll) keV are included when re- 
quired, to roughly model a warm absorber (see also Sect. 13.11 
and 13.21 for a discussion on this issue). No Compton reflection 
component is included in our fits, since the limited bandpass of 
XMM-Newton does not allow us to give good constraints on this 
parameter, in particular when the SNR is not very large. We will 
discuss the possible consequences of this choice on our results 
in Sect. 13.21 In order to study the correlations where the soft 
X-ray luminosities are involved, we decided to compile a sub- 
catalogue, where all the sources with a column density in excess 
of the Galactic one (but still lower than the general 2 x 10^^ cm"^ 
limit of CAIXA) were excluded. This sub-catalogue comprises 
142 sources and will be explicitly referred to when used in this 
paper instead of the whole CAIXA. 

All spectra were analyzed with XSPEC 12.2.1 dArnaudl 

1996'). In the following, errors correspond to the 90% confidence 
level for one interesting parameter (A^^ = 2.71), where not oth- 
erwise stated. The cosmological parameters used throughout this 
paper are //q = 70 km s^' Mpc^', Qa = 0.73 and 0,„ = 0.23 
(i.e. the default ones in XsPEC 12.2.1). 



2.2. Multiwavelength data and BH masses 

Table [3] summarizes the multiwavelength data collected for the 
sources in CAIXA, i.e. the absolute magnitude (100% cover- 
age), the radio fluxes at 6 and/or 20 cm (95% coverage). Full 
Width Half Maximum (FWHM) of H/3 (65% coverage), BH 
mass (57%), V magnitude (100% coverage) and B-V colour in- 
de x (65% coverage). All the objects of the catalogue are included 
in IVeron-Cettv & V eron (2006), who represented the main ref- 
erence for the multiwavelength data. All the other references 
are shown in Table [3] We stress here that the FWHM of Hj6 in 
CAIXA are all larger than 650 km s as expected since it in- 
cludes only sources unobscured or moderately obscured in the 
X-rays. Therefore, our distinction between narrow- and broad- 
line objects simply reflects the standard limit of Hy6 FWHM< 
2 000 km s while no optical Type 2 objects are present in 
CAIXA. Moreover, the standard boundary of Mabs - -23 was 
adopted to classify a source as a quasar or as a Seyfert galaxy. 

2.3. General properties 

CAIXA includes a total of 77 quasars (16 narrow-line and 35 
broad-line objects) and 79 Seyfert galaxies (21 narrow-line and 
30 broad-line objects). The redshift distribution, which spans 
from z=0.002 to z=4.520, is shown in Fig.[T] given the require- 
ment of a high SNR X-ray spectrum, most of the sources are 
local (almost 90% within z < 1). Also shown in the same fig- 
ure are the BH masses distribution (1.4 x 10^ Mq < Mbh < 
2.0 X 10'° Mq): as expected, quasars have, on average, larger BH 
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Fig. 1. General properties of the sources included in CAIXA. Redshift (top) and BH mass (bottom) distributions. Quasars (filled) and 
Seyferts (dashed) are shown on the left, while narrow-line objects (red), broad-line objects (black) and objects without a measure of 
the HjS FWHM (white) are shown on the right. See text for details. 



masses and narrow-line objects the smallest ones, given the cor- 
relation between the BH mass and the FWHM of the HyS. 

The distribution of the hard X-ray luminosities, which cover 
a range between L2-iokeV - 2.0 x 10"*' to 3.9 x 10'*'' erg s"', is 
shown in Fig.|2] The customary X-ray boundary between quasars 
and Seyferts at -L2-iokeV = 1 x lO"*** erg s"' seems to be in fair 
agreement with the historical optical classification adopted in 
CAIXA, even if there is a larger number of X-ray weak quasars. 
Finally, the distribution of the Eddington ratios are shown in the 
lower panels of Fig. |2] these values are derived from the 2-10 
keV luminositie s, adopting a luminos ity-dependent bolometric 
correction, after iMarconi et afl (|2004|) . The average Eddington 
ratio is log(Lboi/iEdd) = -0.79 + 0.08 (standard deviation cr - 
0.71) for all the catalogue, -0.69 + 0.08 (cr - 0.55) for quasars, 
-0.90+0.13 (cr = 0.86) for Seyferts, -0.56 + 0.13 (cr = 0.73) for 
narrow-line objects and -0.84 + 0.08 (cr = 0.58) for broad-line 



objects. These v alues are in agreement wi t h other samples and 
catalogues (e.g. McLure & Dunlopll2004 iBian & Zhaol 12004 
Netzer & Trakhtenbrot 2007), even if in CAIXA narrow-line ob- 
jects do not show, on average, very large values. 



3. Results 

3.1. Goodness of fits 

Fig.[3]shows the;\f^ values obtained from the automatic fit proce- 
dure of CAIXA as a function of the degrees of freedom (dof) of 
each observation. The presence of a second powerlaw to model 
the soft X-ray emission is statistically required in 123 spectra out 
of 156 (see also Sect. 12. It : the single-powerlaw fits are marked 
in orange in Fig. [3] As expected, they are clustered in the low- 
dof part of the plot. The region where the adopted model can be 
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Fig. 2. General properties of the sources included in CAIXA. Hard X-ray (2-10 keV) luminosity (top) and Eddington ratio (bottom) 
distributions. Quasars (filled) and Seyferts (dashed) are shown on the left, while narrow-line objects (red), broad-line objects (black) 
and objects without a measure of the Hy6 FWHM (white) are shown on the right. See text for details. 



considered correct at the 99% confidence level is also shown. 
In other words, for each dof value there is a 1% probability 
of getting a value of as large or larger than the dot-dashed 
curve plotted in figure if the model is correct. Most of the fits 
in CAIXA lie under this curve, confirming the good quality of 
the derived spectral parameter. As an example, we show in Fig. 
Elthe best fit for Mrk 590 0^-2=228/229), which, with its 229 dof 
(the maximum value allowed by the spectral binning criteria de- 
scribed in Sect. 12.1b is one of the observations with the highest 
SNR. 

However, there is a number of fits whose cannot be con- 
sidered statistically acceptable. Not surprisingly, most of them 
correspond to very large dof values: high SNR spectra are more 
difficult to be properly fitted by the simple model we are using 
in CAIXA. As an example, we show in Fig.|5]the "best fit" for 
NGC 3783 0^2=2268/229), which, together with MCG-6-30-15, 



is one with the poorest reduced ;^'2 in CAIXA. As apparent from 
the left panel of Fig.|5] the main problem is represented by huge 
residuals in the soft X-ray part of the spectrum, where the effects 
of absorption from ionised matter is only roughly taken into ac- 
count by two absorption edges (see Sect. 12. 11 1. 

The lack of a proper modeling of warm absorbers in CAIXA 
is dictated by the complexity of these components, which do 
not allow an automatic handling. But, since we are not extract- 
ing information on this component in our catalogue, we should 
only check how much the spectral properties we are interested 
in are affected by this issue. In order to verify t hat, we fitted 
the NG C 3783 data with the model adopted by iReeves et al.l 
(12004 '). which includes three phases for the warm absorbers, 
a thermal black body for the soft excess, a powerlaw and a 
Compton reflection component for the hard X-rays and the iron 
lines. The resulting x' is acceptable (303/221) and compara- 
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Fig. 3. values obtained from the automatic fit procedure of 
CAIXA as a function of the degrees of freedom (dof) of each 
observation. Orange points are those where the second power- 
law is not statistically required. The continuous line refers to 
;i'^/dof=l, while the dot-dashed curve the;^'^ value correspond- 
ing to a 99% Null Hypothesis Probability for the corresponding 
dof value. The vertical dotted line represents the 229 dof limit 
derived from the spectral binning criteria in CAIXA. See text 
for details. 



ble with the one found bv ' Reeves etaTI ( |2004 . The 0.5-2 keV 
flux is 16.06 + 0.02 x 10"'^ erg cm'^ s"', while the 2-10 keV 
flux is 55.29 + 0.06 x 10"'^ erg cm"^ s"'. These values should 
be compared to the ones automatically calculated in CAIXA: 
15.34 + 0.12 X 10-'2 erg cm^^ s-\ while the 2-10 keV flux is 
55.48 + 0.15 X 10"'^ erg cm"^ s~'. Therefore, the difference in 
the soft X-ray flux is lower than 5%, while the hard X-ray fluxes 
are consistent, within errors, and their nominal difference is of 
the order of 3%. The EWs of the three iron lines are all consis- 
tent with the results obtained in CAIXA: 91+^ (93+^) eV, < 1 
(< 1) eV and 25 + 5 (17^^) eV, respectively for neutral iron, Fe 
XXV and Fe XXVI. 

We therefore conclude that, even when the statistical signif- 
icance of the best fit in CAIXA is very low, as in the case pre- 
sented above, the values of the X-ray parameters derived from 
the fit are still very reliable and can be safely used to prop- 
erly characterise the sources. The only parameter which can be 
significantly affected by the simplicity of the model adopted in 
CAIXA is the photon spectral index. In the case of NGC 3783, 
for example, the value derived from the automatic fit proce- 
dure is very flat, being 1.49. The addition of a Compton reflec- 
tion component (with R=l, consis t ent w ith the neutral iron Ka 
line, as suggested by Reeve s et al.l (12004*)) and a complex warm 
absorber significantly steepens the powerlaw index to a more 
canonical value, 1.76. We will discuss this issue in the next sec- 
tion. 



3.2. The X-ray spectral index 

The averag432-10 keV spectral index for the whole catalogue is 
1 .73 + 0.04, with a large spread of cr = 0.45 + 0.03. The hard X- 
ray F appears steeper in quasars (1 .80 + 0.05, consistent w ith the 
value obtained for the PG quasars: iPiconcelh etaLll2005h than in 
Seyferts (1.66 + 0.05). A Kolmogorov-Smirnov test yields that 
the two populations are different at the 99% confidence level. 
The difference is even larger if we consider narrow-line objects 
(F = 1.94 + 0.07) with respect to broad-line objects (1.62 + 0.04). 
In this case, a K-S test gives a NHP of 2 x 10"^: the two popula- 
tions are significantly different with respect to their X-ray pho- 
ton indexes. It is important to stress here that the photon indexes 
quoted in CAIXA, in principle, are not the intrinsic ones, be- 
cause in our fits we do not consider the effect of Compton re- 
flection and warm absorption. Therefore, the reader should keep 
in mind that the intrinsic photon indexes are likely to be steeper 
than the ones reported here, as in the case presented in the pre- 
vious section. Indeed, when the Compton reflection component 
can be well constrained, as in broad band BeppoSAX spectra, the 
average spect ral index is steeper (1.89+0.03 for Seyfert galaxies: 
lDadinall2008l) . It is, therefore, possible that the different photon 
indexes found for different populations in CAIXA do not only 
(or at all) reflect differences in the intrinsic F of the sources, but 
also on the amount of Compton reflection. 

In order to test this hypothesis, we performed a second run 
of automating fitting procedure, with a model which includes 
Compton reflection. Since the amount of reflection cannot be 
constrained with the EPIC pn band coverage (and this is the 
reason why it is not included in the default model adopted in 
CAIXA) we fixed it in each source to a value tightly linked 
with the EW of the neutral iron Ka line. A linear correlation 
between the two parameters was assumed, ado pting R= j| - 1 
for EW=120 eV as normalization (see e.g. iMatt et all [199 lb . 
Furthermore, in order to avoid the technical issues involved in 
a fit with R linked to the EW of an emission line, we derived 
the expected iron Ka EW on the basis of the hard X-ray lumi- 
nosity, following th e best fit for the Iwasawa-Taniguchi effect 
(iBianchi et al.ll2007h : 

\og(EWFe) = (1.73 + 0.03) + (-0.17 + 0.03) log(Lx,44) (2) 

where EWfe is the EW of the neutral iron Ka line in eV and Lx,44 
is the 2-10 keV X-ray luminosity in units of 10"^^ erg s"'. We, 
therefore, introduce a luminosity-dependent Compton reflection 
component. As for the default model in CAIXA, we added a 
second powerlaw to model the soft X-ray emission, only if sta- 
tistically required. 

The distributions of the hard X-ray powerlaw index derived 
for the model which includes Compton reflection are shown 
in the bottom panels of Fig. |6] The average 2-10 keV spec- 
tral index for the whole catalogue is, as expected, steeper, but 
not significantly, being now 1.78 ± 0.04 (cr - 0.46 + 0.03). 
Quasai-s (1.83 ± 0.05) and Seyferts (1.73 + 0.05) are still dif- 
ferent at the 99% level, on the basis of a K-S test. Narrow-line 
(F = 2.00 + 0.07) and broad-line objects (1.65 + 0.04) stifl dis- 
play a very significant difference (NHP of 4 x 10"^ for the K-S 
test). The inclusion of a luminosity-dependent Compton reflec- 
tion component, therefore, does not significantly affect the dis- 
tribution of the powerlaw indexes in CAIXA and, more impor- 
tantly, does not alter the differences between populations found 
with the simpler model. 

' These mean values are 'censored', because they take into account 
errors and upper limits on the measures, using the method described in 
Appendix IAI 
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Fig. 5. Best fit data and model for a bad fit in CAIXA, NGC 3783. The spectral parameters derived from this automatic fit are still 
consistent with those extracted from a proper modeling of the data. See text for details. 



Another important component which is not included in the 
default model adopted in CAIXA is the warm absorber, which 
is kno wn to be present in many sources (see e.g. lPiconcelli et al] 
12004 . If it is not properly modeled, it may result in a flatter 
powerlaw index, as in the case of NGC 3783 reported in Sect. 
13.11 However, a correct modelisation of the complex (and often 
multiple) warm absorption component is far beyond the scopes 
of CAIXA and will not be addressed any more in the following. 
Nevertheless, we would like to stress that most of the studies 
on large samples of AGN adopt simple models, so that the hard 
X-ray powerlaw index reported in CAIXA can be directly com- 
pared to the ones found in literature. On the other hand, it is 
clear that the value of the intrinsic F for single sources may sig- 
nificantly differ from those found with more complex models, 
but this is inherent in any statistical analysis of catalogues and 
the effect is diluted by the large numbers of sources. 



3.3. The iron lines 

The model adopted to fit the X-ray spectra in CAIXA includes 
the Ka emission lines from neutral iron, Fe XXV and Fe XXVI 
(see Sect. 12.11 ). In Fig. |2l we show the distributions of the EW 
for the three iron lines. The neutral iron line is detected in 81 
sources, 46 of which are broad-line objects (out of the 65 in 
CAIXA) and 18 narrow-line objects (out of the 37 in CAIXAfl 
The distribution is clustered around 50-120 eV, as expected if 
arising as reprocessing fro m cold Comp ton-thick matter, as the 
putative torus (see e.g. Mat t et aLlll991h . Only 4 sources have 
(detected) EWs larger than 250 eV, while the average (including 
upper limits) value of the EW is 76 + 6 eV, with quite a large 
spread (cr = 75 eV). Such a large spread is also expected, since 
the neutral iron Ko' EW is strongly anti-correlated to the X-ray 
luminosity (the Iwasawa-Taniguchi effect: Bianchi et al. 2007). 

^ These va lues are slightly different from the ones reported in 
iBianchi et alj {^007), being 81, 43 and 19, because of the update of 
the UJ3 FWHM values in CAIXA. 
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Fig. 6. Hard X-ray powerlaw index distribution in CAIXA, for the best fit model (upper panels) and the model which includes a 
luminosity-dependent Compton reflection component (lower panels). On the left, dashed histograms are Seyfert galaxies, filled ones 
are quasars. On the right, red histograms are narrow-line objects, black histograms are broad-line objects, while white histograms 
are used for sources without a measure of the HjS FWHM in CAIXA. See text for details. 



The Fe XXV line is detected in 44 sources, 26 of which are 
broad-line objects and 1 1 narrow-line objects (out of the 37 in 
CAIXA). Similarly, the Fe XXVI line is detected in 40 sources, 
21 broad-line objects and 12 narrow-line objects. We therefore 
do not find different rates of detection of ionised iron lines for 
these two classes of AGN. These lines may be produced in a 
Compton-thin gas photoionised by the AGN, but EWs larger 
than ^ 100 e V are quite difficult to explain within thi s sce- 
nario (see e.g. iBianchi & Mattll2002t iBianchi et alj|2005[) . The 
effect of the neutral K/3 line which may, in principle, contami- 
nate the flux of the Fe XXVI line, is properly taken into account 
in CAIXA. However, note that lines with such a large value of 
EW are few in CAIXA and always with very large errors (see 
Fig. |7J considering that EWs larger than 130 eV are detected 
only in 4 and 2 sources, for Fe XXV and Fe XXVI, respectively). 



In any case, we cannot exclude that an ionised accretion disc 
may contribute to the flux of the ionised iron lines in the ob- 
jects where the measured EWs are quite lar ge, as suggested, f or 
example, in Mrk 766 by variability studies jMiller et alj|2006l) . 



Both the Fe XXV and Fe XXVI lines are detected in 24 
sources, 15 broad-line objects and 6 narrow-line objects. We plot 
in Fig.[8]the EW of one line against the other. The ratio between 
the two EWs is in agreement with photoionised models, but on 
average pointing towards large values of the ionization parame- 
ter ( Bianchi & Mat^l2002h . This is not surprising, given the fact 
that these sources are those where both lines are observed and 
thus the gas must be ionised enough to produce a large EW for 
H-like iron. Column densities up to ^ 10^^ cm"^ are required to 
reproduce most of the observed EWs for He-like iron, while the 
column densities suggested by the Fe XXVI EWs are, on average. 
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15 30 45 60 75 90 105 120 
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Fig. 7. Distribution of neutral iron, Fe XXV (left) and Fe XXVI (right) measured (no upper limits) EWs in CAIXA. Red histograms 
are narrow-line objects, black histograms are broad line objects and white histograms are for objects with no measure in CAIXA 
for the Hj3 FWHM. Very large EWs are not included in these distributions. See text for details. 



100 




10 100 
FeXXVl Line EW (eV) 



Fig. 8. Fe XXV against Fe XXVI EWs for the objects in CAIXA 
where at least one of the emission lines is detected (upper lim- 
its for both lines are excluded). The diagonal lines shows the 
expected ratios between the two fo r three different value s of 
the X-ray ionization parameter (see iBianchi & Matt! l2002l for 
details). The dashed lines show the EWs expected for the two 
lines if they are produ ced in a gas photoionised by the AGN (see 
iBianchi & Mattll2002l for details). The different symbols refers 
to the classification of the objects, on the basis of their abso- 
lute magnitude and H/3 FWHM: NLSY, narrow-line Seyfert 1 ; 
BLSY, broad-line Seyfert 1; NCSY, not-classified Seyfert 1 (no 
HjS FWHM measure available); NLQ, narrow-line quasar; BLQ, 
broad-line quasar; NCQ, not-classified quasar (no Hfi FWHM 
measure available). 



larger. This result is not uncommon: see lBianchi et alj ( l2005h for 
a detailed discussion. 



3.4. The soft excess 

As mentioned in Sect. 13.11 the presence of a second powerlaw 
to model the soft X-ray emission is statistically required in 123 
spectra out of 156. Although the need for this component is 
clearly requested in the presence of neutral absorption of the 
primary continuum (which is present only in 14 sources), in the 
majority of the cases it reflects the presence of an excess in the 
soft X-ray regime with respect to the intrinsic powerlaw emis- 
sion. Considering also that most of the single-powerlaw fits are 
clustered in the low-SNR spectra, where the presence of a soft 
excess is more difficult to detect, we can conclude that this com- 
ponent is very c ommon in CAIX A, in agreement with previous 
studies (see e.g. lPiconcefli et an i2004). 

The modelisation of the soft excess with a simple powerlaw 
has no physical justification. Indeed, the values of the soft X-ray 
indexes found in CAIXA are often very steep (see Table |2]) and 
cannot be naturally explained. Moreover, as already showed by 
several authors (Gierlinski & Done 2004; Crumrny et al. 2006'), 
the soft excess in unobscured radio-quiet AGN, when fitted with 
a thermal model, is characterized by a temperature which is 
amazingly constant across orders of magnitude of luminosities 
and BH masses. We tested the same scenario in CAIXA, adopt- 
ing another fitting model, with a thermal black body component 
instead of the soft powerlaw. The results are shown in Fig. |9) 
even on a catalogue of more sources and which spans larger 
orders of magnitude in luminosities and BH masses than pre- 
vious studies, the temperature of the thermal emission appears 
completely unrelated to the fundamental properties of the AGN. 
Moreover, it was shown that, even for the same object, a varia- 
tion of the soft X-ray luminosity of one order of magnitude does 
not produce any variation of this 'universal' temperature, in con- 
trast with what ex pected frorn the b asic physics of disk accretion 
and black bodies (.Ponti et al.ll2006l) . 

In the last few years, two main competing models have been 
proposed to explain the origin of the soft excess, both ascribing 
the apparent universal temperature of the soft excess to atomic 
physics processes. In one scenario, the soft excess arises from 
the enhancement of reflection from the inner regions of the ac- 
cretion disk due to light-bending effects, tog ether with a strong 
suppr ession of the primary emission (e.g. iMiniutti & FabiarJ 
|2004|) . On the other hand, the soft excess can also be mimicked 
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Fig. 9. Temperature of the black body model, when used to model the soft excess in the objects of CAIXA. It appears completely 
unrelated to the X-ray luminosity (left and the BH mass (right). The different symbols refers to the classification of the objects, on 
the basis of their absolute magnitude and Hfi FWHM: NLSY, narrow-line Seyfert 1 ; BLSY, broad-line Seyfert 1 ; NCSY, not-classified 
Seyfert 1 (no H/3 FWHM measure available); NLQ, narrow-line quasar; BLQ, broad-line quasar; NCQ, not-classified quasar (no H/3 
FWHM measure available). 



by absorption from a rel ativistically outflowing warm gas (e.g. 
iGierlinski & Donell2004 . Present X-ray spectra do not allow us 
to discriminate between the two models. 

Therefore, we decided to investigate the nature of the soft 
excess adopting a model-independent parameter. We chose the 
ratio between the soft X-ray luminosity (rest-frame 0.5-2 keV) 
and the hard X-ray luminosity (rest-frame 2-10 keV) as an in- 
dicator of the importance of the soft excess with respect to the 
overall X-ray spectrum of the source. As shown in Sect. 13.11 
the luminosities extracted with the automatic fitting procedure 
in CAIXA are not significantly affected by the adopted model 
and so their ratio is a good parameter to characterize at least the 
basic property of the soft excess of the sources, i.e. its strength 
with respect to the primary X-ray emission. 

In Fig. [To] we show the distribution of the X-ray luminosity 
ratio in CAIXA. As already mentioned in Sect. 12.11 we stress 
here that this value is calculated only for sources with no neutral 
absorption, for which the intrinsic soft X-ray luminosity is not 
measurable. On the other hand, as akeady commented in pre- 
vious sections, a warm absorber component is not modeled in 
CAIXA, so some of the calculated X-ray luminosity ratios may 
be lower than the intrinsic ones: in some objects, low ratios may 
not be an intrinsic property, but instead they may indicate the 
presence of a strong warm absorption component. Although the 
majority of the sources cluster around a value of 1, narrow- and 
broad-line objects are significantly different populations with re- 
spect to the X-ray luminosity ratio, the NHP being 6 x lO"** for a 
K-S test. Indeed, it is very interesting to note that no narrow-line 
object is found with a value of the luminosity ratio lower than 
unity. In other words, narrow-line objects tend to have a strong 
soft X-ray luminosity with respect to their hard X-ray emission. 
We defer the reader to a forthcoming companion CAIXA paper 
for a detailed discussion on the correlations between the X-ray 
luminosity ratio and other multiwavelength properties of AGN, 
and its possible consequences on the origin of the soft excess. 

4. Conclusions 

We presented CAIXA, a Catalogue of AGN In the XMM- 
Newton Archive. It consists of all the radio-quiet X-ray unob- 




■0,5-3 keV"-2-iO keV 



Fig. 10. Hard X-ray powerlaw index distribution in CAIXA. Red 
histograms are narrow-line objects, black histograms are broad 
line objects and white histograms are for objects with no mea- 
sure in CAIXA for the FWHM. See text for details. 



scured (Nh < 2x10 cm"^) AGN observed by XMM-Newton in 
targeted observations, whose data are public as of March 2007. 
We performed a complete and homogeneous spectral analysis of 
the X-ray data, whose main results can be summarised as fol- 
lows: 



- The X-ray spectral index. The average 2-10 keV spectral in- 
dex for the whole catalogue is 1.73 + 0.04 (cr = 0.49 + 0.02) 
and appears significantly steeper in narrow-line objects and 
quasars with respect to broad-line objects and Seyfert galax- 
ies. The average value for CAIXA and the differences be- 
tween source populations do not significantly change if a 
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luminosity-dependent Compton reflection component is in- 
cluded in the model. 

- The iron lines. The distribution of the neutral iron Ka EW 
is clustered around 50-120 eV, as expected if arising as re- 
processing from cold Compton-thick matter, as the putative 
torus. Fe XXV and Fe XXVI are commonly detected, but we 
do not find different rates of detection of these lines between 
narrow- and broad-line objects. The ratio between the EWs 
of the two lines is, on average, broadly in agreement with 
photoionisation models. 

- The soft excess. This component is very common in CAIXA, 
in agreement with previous studies. We chose the ratio be- 
tween the soft X-ray luminosity (rest-frame 0.5-2 keV) and 
the hard X-ray luminosity (rest-frame 2-10 keV) as a model- 
independent indicator of the strength of the soft excess. With 
respect to this parameter, narrow- and broad-line objects are 
significantly different populations. Indeed, it is very interest- 
ing to note that no narrow-line object is found with a value 
of the luminosity ratio lower than unity. 

On the basis of these results, we will investigate the correla- 
tions between the X-ray and the multiwavelength properties of 
the sources in CAIXA in a companion paper. 
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Appendix A: Censored mean 

Given the large ertors and the presence of upper limits for many of the param- 
eters derived in CAIXA, most average values cited in this paper are 'censored 
means', if not otherwis e stated. The adopted method is is very similar to the 'cen- 
so red fits' presented inlBianchi et all 120071) , based on the linear fits performed 
bv lGuainazzi et alJ<2006 H). and can be summarized as follows. A large number 
of mean values was performed on a set of Monte-Cai'lo simulated data derived 
from the experimental points according to the following rules: a) each detection 
was substituted by a random Gaussian distribution, whose mean is the best-fit 
measurement and whose standard deviation is its statistical uncertainty; b) each 
upper limit U was substituted by a random uniform distribution in the interval 
[0,U]. The average (with relative uncertainty) of the mean values of each sin- 
gle data set is the 'censored mean' reported in the text, along with the average 
dispersion. 
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Table 1. Main properties of the observations of the sources included in CAIXA. 
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Source ObsID T (ks) F(0.5-2 keV) F(2-10 keV) L(0.S-2 keV) L(2-10 keV) 

(1) (2) (3) (4) (5) (6) (7) 



Full table will be published in electronic form at the CDS. 

(I) Source name iveron-Cettv & Veronl ll OO g); (2) XMM-Newton OBSID (3) Clean exposure time; (4) 0.5-2 keV flux (10"'^ erg cm"^ g-l). (5) 2-10 keV flux 
(10"'- erg cm"2 5-1-). (5) Unabsorbed 0.5-2 keV luminosity (10"*^ erg s"'); (7): Unabsorbed 2-10 keV luminosity (10'''' erg s"'). 
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Table 2. Main X-ray parameters derived from the automatic fits of the observations included in CAIXA. 



Source 

(1) 


Obsid 

(2) 


(3) 


Th 

(4) 


EW(,4 

(5) 


EW6.7 

(6) 


EW6.96 

(7) 





Full table will be published in electronic form at the CDS. 

(I) Source name iVeron-Cettv & Veronl2006l) ; (2) XMM-Newton OBSID; (3) Soft X-ray powerlaw index; (4) Hard X-ray powerlaw index; (5) 6.4 keV emission 
line EW (eV); (6) 6.7 keV emission line EW (eV); (7) 6.96 keV emission line EW (eV). 
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Table 3. Multiwavelength properties of the sources included in CAIXA. RQ means the source is consided radio-quiet in the reference, but no radio 
flux is reported. 



NAME RA DEC z M^bs F^cm Fjocm H/j Mbh mv B-V 

(1) (2) (3) (4) (5) (6) (7) (8) (9) (10) (11) 



Full table will be published in electronic form at the CDS. 

(1) Source name ( Veron-Cettv & Veron 2006); (2)-(3): Coordinates in J2000 (NED); (4) Redshift (%rgn-Cettv & VeronI I2006D; (5) Absolute magnitude 

fVeron-Cettv & Veron 2006); (6)-(7): Radio fluxes at 6 and 20 cm (Jy). References; 1. Veron-Cettv & Veron (2006), 2. Kellermann et al. 1 1989), 3. Greeorv et^al] 
(1996), 4. Condon etal. (1998), 4u. Sensitivity limit in Condon et af 1 1998), 5. Kuhn et al. (2001), 6. Veron-Cettv et al (1991), 7. Gallo et al. (2004), 8. 
[Smith & Wright 1 1980), 9. Page et al. (2004), 10. Gangulv et al (2006), 11. Mauch et al (2003), 12. Yuan et af (1998), 13. Wang et al. (2004) 14. Sadler et aO 
(2002), 15. . 16. McMahon et al. (1999); (8) FWHM (km s"'). References; a. Boroson & Green ( 1992), h. Wang & Lu (2001), c. Vestergaard (2002), d. Wandel et 3 
( 1999), e. Bofler et al. ( 1996), f Grupe et al. ( 1999) g. Grupe et al. (2004), h. Bian & Zhao (2003a), i. Peterson et al. (2004), j. Wang et al. ( 1996), k. Leighlv ( 199^ 
1. Marziani et al. (2003), m. Mcintosh et al. ( 1999), n. Winkler ( 1992), o. Gangulv et al. (2001), p. Corbin ( 1991), q. Corbin & Boroson ( 1996), v. Shemmer et S] 
f2006l) . s.iCrenshaw et al. ( 2003), t. McLure & Dunlop ( 2001 ), u. Laor ( 2000), v. Reynolds et al. ( 1997), w. Zhou & Wang ( 2005); (9) Log of BH mass in units of 
solar masse s. References; A. Woo & Urry (2002), B. Vestergaard & Peterson (2006), C. Vestergaard (2002), D. Zhang & Wang (2006), E. Wang & Lu (2001), R 
IShemmer e t al. (2006), G. ICzemv e t al. (2004), H.|Laoij (199i), L IBian & Zhad j2003bl) . J. IO'NeiU et alji2005ll . K. IMerloni et alj >2003). L. Zhou & Wang (2005) - 
Notes r;reverberatio n mapping, o; H/ ? FWHM, m; Mgll FWHM, v; stellar velocity dispersion; n: narrow hne region size (10) my fVeron-Cetty & VerorL2006.) ; (11) 
B-V (mag) jVeron-Cetty & Verori2006l) . 



